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In the presence of large uncertainties, a control system needs to be able to adapt rapidly to regain per- 
formance. Fast adaptation is referred to the implementation of adaptive control with a large adaptive gain to 
reduce the tracking error rapidly. However, a large adaptive gain can lead to high-frequency oscillations which 
can adversely affect robustness of an adaptive control law. A new adaptive control modification is presented 
that can achieve robust adaptation with a large adaptive gain without incurring high-frequency oscillations as 
with the standard model-reference adaptive control. The modification is based on the minimization of the 
norm of the tracking error, which is formulated as an optimal control problem. The optimality condition is 
used to derive the modification using the gradient method. The optimal control modification results in a stable 
adaptation and allows a large adaptive gain to be used for better tracking while providing sufficient stability 
robustness. Simulations were conducted for a damaged generic transport aircraft with both standard adaptive 
control and the adaptive optimal control modification technique. The results demonstrate the effectiveness of 
the proposed modification in tracking a reference model while maintaining a sufficient time delay margin. 


I. Introduction 

Adaptive control is a potentially promising technology that can improve performance and stability of a conven- 
tional fixed-gain controller. In recent years, adaptive control has been receiving a significant amount of attention. In 
aerospace applications, adaptive control has been demonstrated in a number of flight vehicles. For example, NASA 
has recently conducted a flight test of a neural net intelligent flight control system on board a modified F-15 test 
aircraft. 1 The U.S. Air Force and Boeing have successfully developed and completed numerous flight tests of direct 
adaptive control on Joint Direct Attack Munitions (JDAM). 2 The ability to accommodate system uncertainties and to 
improve fault tolerance of a control system is a major selling point of adaptive control since traditional gain-scheduling 
or fixed-gain control methods are viewed as being less capable of handling off-nominal operating conditions outside 
of a normal operating envelope. Nonetheless, these traditional control methods tend to be robust to disturbances and 
unmodeled dynamics when operated as intended. 

In spite of the advances made in the field of adaptive control, there are several challenges related to the imple- 
mentation of adaptive control technology in aerospace systems. These challenges include but are not limited to: 1) 
robustness in the presence of unmodeled dynamics 3 and exogenous disturbances; 2) stability metrics of adaptive con- 
trol as related to adaptive gain and input signals; 3) adaptation in the presence of actuator dynamic constraints; 4) 
on-line reconfiguration and control reallocation using non-traditional control effectors; and 5) time-scale separation in 
actuator systems with different time latency. 

The absence of the verification and validation methods of adaptive control systems remain a major hurdle to the 
implementation of adaptive control in safety-critical systems. 4,5 This hurdle can be traced to the lack of performance 
and stability metrics for adaptive control which poses a major challenge that prevents adaptive control from being 
implemented in safety critical systems. The development of verifiable metrics for adaptive control will be important in 
order to mature adaptive control technology for use in operational safety-critical systems. Of these, stability metrics of 
adaptive control are an important consideration for assessing system robustness to unmodeled dynamics and exogenous 
disturbances. In one aspect of verification and validation, a control system is usually certified by demonstrating that it 
meets an acceptable set of requirements or specifications for stability margins, among other things. Herein lies a major 
challenge for verification and validation as there is no existing standard tool for stability margin analysis of nonlinear 
adaptive control. The lack of stability metrics for adaptive control is viewed as a technology barrier to developing 
certifiable adaptive control for safety-critical systems. 4,5 
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Over the past several years, various model-reference adaptive control (MRAC) methods have been investigated. 6-15 
The majority of MRAC methods may be classified as direct, indirect, or a combination thereof. Indirect adaptive 
control methods are based on identification of unknown plant parameters and certainty-equivalence control schemes 
derived from the parameter estimates which are assumed to be their true values. 16 Parameter identification techniques 
such as recursive least-squares and neural networks have been used in indirect adaptive control methods. 7 In con- 
trast, direct adaptive control methods directly adjust control parameters to account for system uncertainties without 
identifying unknown plant parameters explicitly. MRAC methods based on neural networks have been a topic of great 
research interest. 8-10 Feedforward neural networks are capable of approximating a generic class of nonlinear functions 
on a compact domain within arbitrary tolerance, 17 thus making them suitable for adaptive control applications. In par- 
ticular, Rysdyk and Calise described a neural net direct adaptive control method for improving tracking performance 
based on a model inversion control architecture. 8 This method is the basis for the intelligent flight control system that 
has been developed for the F-15 test aircraft by NASA. Johnson et al. introduced a pseudo-control hedging approach 
for dealing with control input characteristics such as actuator saturation, rate limit, and linear input dynamics. 10 Ho- 
vakimyan et al. developed an output feedback adaptive control to address issues with parametric uncertainties and 
unmodeled dynamics. 12 Cao and Hovakimyan developed an j£?i adaptive control method to address high-gain con- 
trol. 13 Nguyen et al. developed a hybrid direct-indirect adaptive control to also deal with high-gain control issues 
during adaptation. 

While adaptive control has been used with success in a number of applications, the possibility of high-gain control 
due to fast adaptation can be an issue. In certain applications, fast adaptation is needed in order to improve tracking 
performance when a system is subject to a large source of uncertainties such as structural damage to an aircraft that 
could cause large changes in system dynamics. In these situations, a large adaptive gains can be used in the adaptation 
in order to reduce the tracking error rapidly. However, there typically exists a balance between stability and adaptation. 
It is well known that high-gain control or fast adaptation can result in high-frequency oscillations which can excite 
unmodeled dynamics that could adversely affect the stability of an MRAC law. 16 Recognizing this, some recent 
adaptive control methods have begun to address high-gain control, such as the adaptive control 13 and the hybrid 
direct-indirect adaptive control. 14 In the former approach, the use of a low-pass filter effectively prevents any high 
frequency oscillation that may occur due to fast adaptation. In so doing, the reference model is no longer preserved and 
instead must be reconstructed using a predictor model. In the latter approach, direct and indirect adaptive control are 
blended together within the same control architecture. The indirect adaptive law is based on a recursive least-squares 
parameter estimation that is used to adjust parameters of a nominal controller to reduce the modeling error, and any 
remaining tracking error signal could then be handled by a direct adaptive law with a smaller adaptive gain. 

Various modifications were developed to increase robustness of MRAC by adding damping to the adaptive law. 
Two well-known modifications in adaptive control are the cr -modification 1 6 and £\- modification. 18 These modifica- 
tions have been used extensively in adaptive control. This paper introduces a new adaptive law based on an optimal 
control formulation to minimize the norm of the tracking error. The optimality condition results in a damping 
term proportional to the persistent excitation. The analysis shows that the adaptive optimal control modification can 
allow fast adaptation with a large adaptive gain without causing high-frequency oscillations and can provide improved 
stability robustness while preserving the tracking performance. 

This paper is organized in two major sections. The first section is devoted entirely to the theoretical development 
of the new adaptive optimal control law which includes a Lyapunov stability proof and a derivation of an upper bound 
of the tracking error. The second section demonstrates a practical implementation of the optimal control modification 
in a neural net adaptive flight control architecture for a damaged generic transport aircraft. Simulation results are 
presented to demonstrate potential benefits of the new adaptive control law. 
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II. Theory of Optimal Control Modification 


— ► Reference Model 


( )- 


Confroiler 


Controller Gain 


Fig. 1 - Direct MRAC 


A direct MRAC problem is posed as follows: 

Given a nonlinear plant as 

x = Ax + B[u+f(x)\ (1) 

where x{t ) : [0,oo) — > R n is a state vector, u(t) : [0,oo) — > W is a control vector, A e R nxn and B e R nxp are known 
such that the pair (A,B) is controllable, and / (x) : R n — » W is a bounded unstructured uncertainty. 

Assumption 1: The uncertainty / (x) can be approximated using a feedforward neural network in the form 

/(*) = Z (*) + e ( x ) = W + £ 0) (2) 

i=l 


where 0* e is an unknown constant weight matrix that represents a parametric uncertainty, ® (x) : — ► R m is 

a vector of known bounded basis functions with Lipschitz nonlinearity, and e (x) : — ► M. p is an approximation error. 

Since O (x) is Lipschitz, then 

||<D(x)-O(x 0 )|| <C||x — x 0 || (3) 

for some constant C > 0, which implies a bounded partial derivative 


dx 


( 4 ) 


for some constant L > 0. 

The set of basis functions O (x) is chosen such that the approximation error e (x) becomes small on a compact 
domain x £ W 1 . The universal approximation theorem for sigmoidal neural networks by Cybenko can be used for 
selecting a good set of basis functions O (x). 17 Alternatively, the Micchelli’s theorem provides theoretical basis for a 
neural net design of 0 T ® (x) using radial basis functions to keep the approximation error e (x) small. 19 

Assumption 2: The set ofbasis functions O (x) satisfies the persistent excitation (PE) condition for some ao, ai , T 0 > 


0 


1 r^+^o 

a \ />— / O(x(t))0) t (x(t))^t > a 0 /, V*>0 (5) 

To J t 


where / is an identity matrix. 

The objective is to design a controller that enables the plant to follow a reference model 


%m — A m x m T" B m r (6) 

where A m G M nxn is Hurwitz and known, B m e R nxp is also known, and r (t) : [0,oo) — ► W e JSfoo is a command vector 
with r e «5foo. 

Defining the tracking error as e = x m — x, then the controller u ( t ) is specified by 

u = K x x + K r r - u ad (7) 
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where K x G R pxn and K r G R pxp are known nominal gain matrices, and u ad G is a direct adaptive signal. 
Then, the tracking error equation becomes 


e = x m -x = A m e + {A m -A - BK x )x-\- (B m - BK r ) r + B u ad - 0* T O (x) - e (x) 


( 8 ) 


We choose the gain matrices K x and K r to satisfy the model matching conditions A +BK X = A m and BK r — B m . 
The adaptive signal u ad is an estimator of the parametric uncertainty in the plant such that 


« ad = 0) 


( 9 ) 


where 0 e R mxp is an estimate of the parametric uncertainty 0*. 

Let 0 = 0 — 0* be an estimation error of the parametric uncertainty. Then the tracking error equation can be 
expressed as 


e = A m e + B 


(eTo-e) 


Proposition 1: The following adaptive law provides an update law that minimizes \\e\\^ 2 

0 = -ro (e T P- v<& t ®B t PA-^ b 


( 10 ) 


( 11 ) 


where T = T t > 0 G is an adaptive gain matrix, v > 0 G R is a weighting constant, and P = P T > 0 G l nx ” 
solves 

PA m +AlP = -Q (12) 

where Q = Q r >0 £ R" x ". 

Proof: The adaptive law seeks to minimize the cost function 


J=\j f {e-A) T Q{e-A)dt 


(13) 


subject to Eq. (10) where A represents the tracking error at t = tf. 

J is convex and represents the distance measured from the normal surface of a ball B r with a radius A. 



Fig. 2 - Tracking Error Bound 

This optimal control problem can be formulated by the Pontryagin’s Minimum Principle. Defining a Hamiltonian 
H(e,@) = ^{e-A) T Q{e-A)+p T (A m e + B® T ®-Be^) (14) 

where p (f) : [0,oo) — > W 1 is an adjoint variable, then the necessary condition gives 

p=-VHj =-Q(e-A)-Alp (15) 

with the transversality condition p (tf) = 0 since e (0) is known. The optimality condition is obtained by 

Vtf0T=OVtf@T O = ®/> T B (16) 
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The adaptive law is formulated by the gradient method as 


& = -rvtf@ T = -rop T s (17) 

The solution of p can be obtained using a “sweeping” method 20 by letting p = Pe + S® <J> . Then 


Pe+p(A m e + 50 t O - 50 * T ® - Be ) + 5© t O + = —Q (e — A)—A^ (fe + S® T 


(18) 


which yields the following equations 


P+PA m +AlP+Q = 0 
5 + P 5 + ^=0 

QA - S - ^ + PB (9* t O + g) = 0 


(19) 

( 20 ) 

(21) 


subject to the transversality conditions P ( tf ) = 0 and S ( tf ) = 0. 

The existence and uniqueness of the solution of the Lyapunov differential equation (19) is well-established. It 
follows that Eq. (20) also has a stable, unique solution in time-to-go x = tf — t. 

Since r G Jzfoo, is bounded and Lipschitz, and p (tf) = 0 from the transversality condition, then as tf — ► oo, 
lim^oo | d (0 T O) fdt\ exists, where 


lim 

d (0 T <J>) 

= lim 

tf — >oo 

dt 

tf — >00 


-b t p (tf) o T ro + ® T ~ \± m - b (© T ® - £ )] 


< lim 

tf—> OO 


® t LJ? \-A~'B m r-B (© T ® -g)] I = a, (22) 


for some constant vector o t > 0 G and ^ G R mxn is a matrix whose elements are all equal to one. 

Consider an infinite time-horizon problem as tf — ► oo, then P(t) — ► P (0) and S(t) — ► S (0). The constant solutions 
of P and S are determined by their steady state values from Eqs. (19) and (20) that give 


PA m ~f~ A m P — Q 


(23) 


S= —A~ T PB 

The adjoint p is then obtained as 

p = Pe — vA~ T PBQ r O 


(24) 

(25) 


where v is introduced as a weighting constant to allow for adjustments of the second term in the adaptive law. Since 
©* is constant, then the adaptive law (1 1) is obtained from Eqs. (75) and (25). 

Defining 5 e = sup, \e | and <p = sup, |0* T cp | , then for v = 1 the unknown tracking error AaU = fy— >-oois bounded 


by 


l|A|| < 


ug 

Kin (Q) 


Ml +11* 11+ 



(26) 


where X and cr denote the eigenvalue and singular value, respectively. 


Theorem 1: The adaptive law (11) results in stable and uniformly bounded tracking error outside a compact set 

r \ 

, 2 


w 


= \ e,0 T O eR n :Kin(Q) [(||e|| - Aj) 2 -A 2 ] +vKin (b T A~ t QA~ 1 b) (||© T ®|| -A 2 ) -A: 


< 0 } (27) 


where 


_ Kaxjf) || 5 || ||*|| 

Kin (2) 

a^EpPA-'B) jjgj| 

Kin (FA^ T QA^B) 


(28) 

( 29 ) 
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Proof: Choose a Lyapunov candidate function 

V = e T Pe + trace (® T T-'®^j (30) 

Evaluating V yields 

V = e T (A m P+PA m )e + 2e T PB(® T ®-e^ -2trace 0 T O (e T PB - v<t> T ®B T PA m ] B^ (31) 

Using the trace identity trace (A t B) =BA t ,V can be written as 

V = -e T Qe + 2 e T PB (© T ® - e) - 2 e T PB® T <$ +2v<D T ®B T PA m ] B® r <& (32) 

The sign-definiteness of the term PA~ l is now considered. We recall that a general real matrix G is positive 
(negative) definite if and only if its symmetric part G = \ (G + G T ) is also positive (negative) definite. Then, by pre- 
and post-multiplication of Eq. (12) by A~ r and A~\ respectively, PA~ l can be decomposed into a symmetric part M 
and anti-symmetric part N as 

PA~ l = M+N (33) 

where 

M=\(A- m v P + PA-') = - X -A- m v QA-J (34) 

N=\{pA?-A?p). (35) 

Since the symmetric part M < 0, we conclude that PA~ l <0. Then, V becomes 
V = -e T Qe-2e T PB£ - v® T GB T A m T QA m l BG T <3> + 2vO T 05 T 7V r 50 T O + 2v® T G*B T PA~ ] BG T ® (36) 

Letting y = Z?0 T O and using the property y T Ny = 0 for an anti-symmetric matrix N, V is reduced to 

V = -e T Qe-2e T PBe - v® T ®B T A m T QA m } Be T <!>+2v® T G*B T PA m 'BG T ® (37) 

which is bounded by 

v < -Kin{Q) ||e|| 2 + 2|| e ||A max (/ J ) ||5|| ||5 £ || - vk min (b t A~ t QA~ m x ||© T o|| 2 

+ 2 vo max ||©* T 0»|| ||© T o|| (38) 

By completing the squares, an upper bound of V is obtained as 

V<-Kin{Q) [(||e||-A 1 ) 2 -Af] - vl min (B T A- T QA-'B ) [(||© t O || - A 2 )' - A 2 (39) 

If W is a compact set defined in Eq. (27), then the time rate of change of the Lyapunov candidate function is 
strictly semi-negative in the complementary set W c and positive in the compact set W. Thus, the Lyapunov candidate 
function V decreases everywhere in the complementary set W c , but increases in the compact set W which contains 
the origin at e = 0 and 0 = 0. Any trajectory of e and 0 starting in W will remain in W for all t. Therefore, the 
compact set W is an invariant set. 16 Then, any trajectory e and 0 starting in the complementary set W c will approach 
the largest invariant set W as t — ► oo. 21 It follows by the LaSalle’s Invariance Principle that e and 0 are ultimately 
uniformly bounded. Thus, the adaptive optimal control modification is stable. 


Remark: The effect of the optimal control modification is to add damping to the weight update law so as to reduce 
high-frequency oscillations in the weights. The damping term depends on the persistent excitation (PE) condition. 
With persistent excitation, the weight 0 is exponentially stable and bounded. This scheme is contrasted to the well- 
known cr- 16 and £i- 18 modification methods and other variances which also add damping terms to prevent parameter 
drift in the absence of the persistent excitation. 18 These adaptive laws are compared as follows: 
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Modification 

Adaptive Law 

<j- 

0 = -r (®e T PB + <J0), <j > 0 

£l- 

0 = -r (Oe T PB + pi e T PB ®),pi> 0 

Optimal 

0 = -r (®e T P5 - v®®^^^- 1 ^), v > 0 


Table 1 - Modifications to MRAC Law 


Theorem 2: In the presence of fast adaptation, i.e., (r) 1, the adaptive law (1 1) is robustly stable for v = 1 

with all closed-loop poles having negative real values. 

Proof: The adaptive law (11) can be written as 


If r » 1 is large and the input is PE, then in the limit as O r rO — ► oo 

50 T ® -► - p-'AlPe 

v 1 

Hence, the closed-loop tracking error equation becomes 


(40) 


(41) 


e = -P 


,-i 


1+v 

2v 


Q- 


1 — V 

2v 


!-S^0* T O+e) 


(42) 


where S = A m P — PA m . 

For v = 1, the closed-loop poles are all real, negative values with Re[s] = —X {P~ l (f). The system transfer 
function matrix H(s) = (sI + P~ l Q) 1 is strictly positive real (SPR) since H(jco) +H T (-jco) > 0, and thus the 
system is minimum phase and dissipative. 22 The Nyquist plot of a strictly stable transfer function for a SISO system 
is strictly in the right half plane with a phase shift less than or equal to f 22 corresponding to a phase margin of at least 
| . For a MIMO system, the diagonal elements of the system transfer function matrix exhibit a similar behavior. 


Lemma 1: The equilibrium state y = 0 of the differential equation 

y=-0 T (t)T0 ( t)y 


(43) 


where y (f ) : [0, oo) — > R, O (f ) e J *?2 * [0, oo) — > M" is a piecewise continuous and bounded function, and T > 0 e 
is uniformly asymptotically stable, if there exists a constant y > 0 such that 


1 r*+ T o 

— J (z)r<S>(T)dT>y 

which implies that y is locally bounded by the solution of a linear differential equation 

z = —yz 

for t £ [ti,ti + To], where U = U-\ + 7o and / = 1,2, oo. 

Proof: Choose a Lyapunov candidate function and evaluate its time derivative 

1 9 

v = -® T (^)r® (0/ = — 2 ® T (or® (0 v 

Then, there exists y > 0 for which V is uniformly asymptotically stable since 

v (t + To) = V (0 exp ^-2 J + ° ® T (t) r® (t) c?t) < V ( t ) e 


-2 yT 0 


(44) 


(45) 


(46) 

(47) 

(48) 
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This implies that 


exp 2jf O t (t)FO(t)*/t^ <e lyT ° 


Thus, the equilibrium y = 0 is uniformly asymptotically stable if 

\ ft+ T o 


— j <P T (T)r(I)(T)c?T > 7 


(49) 


(50) 


provided ® (f) e 2 is bounded. 
Theny(f) G JSfoo since 


/•oo /*oo 

V( t ->x)-V(0) < — 2y I y 2 (t) dt =>2y / / (f)<* < F(0) - V(t-> oo) < oo (51) 

JO Jo 


It follows that 

F< -2yV^yy< -yy 1 (52) 

which implies that the solution of Eq. (43) is bounded from above ify > 0 and from below ify < 0 by the local solution 
of 

z=—yz (53) 

for t £ [t(,ti + Jo]> where to = 0, U = ti-\ + To, and i = 1,2, . . . ,n — ► oo. 

Now, suppose that ® ® (y), Eq. (53) still applies. The condition O (y (*)) £ J*?2 is identically satisfied since 

y £ =£2 H JSfoo. To show this, we first evaluate F as 

V = -O t (y) ro (y)y 2 = -20 T (y) TO (y) F (54) 


Expressing F (f ) as F (y) yields 


.dV 


dV 


V=y- = -®' (y)rO(y)y- = - 20 1 (y)T<D(y)F 






(55) 


Thus 


Suppose we can find y such that 


dV =2 dy 
V y 


dy 

y 


< —ydt 


(56) 

(57) 


Then multiplying both sides of Eq. (57) by y 2 and dividing by dt result in the same equation as Eq. (52). Thus, F 
is uniformly asymptotically stable andy is bounded by the same equation as Eq. (53). Therefore, y given by Eq. (44) 
satisfies Eq. (57). 

Lemma 1 is a version of the well-known Comparison Lemma. 21 A different version of the proof is also provided 
by Nadrenda and Annaswamy. 23 


Lemma 2: The solution of a linear differential equation 

y = Ay+g(t) (58) 

where y(t) : [0,oo) — ► W 1 , A £ R nxn is a Hurwitz matrix, and g(f) : [0,oo) — > W 1 £ is a piecewise continuous, 
bounded function, is asymptotically stable and semi-globally bounded from above by the solution of a differential 
equation 

z = A (z — a |^4 _1 c|) (59) 

where a > 1 £ R and c = sup, |g (01- 

Proof: For matching initial conditions y (0) =z( 0), the solutions ofy and z are 

y=e At y( 0)+ [ (60) 

Jo 
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e 4 ^ T ^aA\A x c\dx 


If A 1 c>0, then 


y = z-\- f e 4 ^ x ^acdz-\- [ e 4 ^ x ^g(z)dz =z-\- f e 4 ^ X ^A [aA X c + A x g{z)\dz (62) 

Jo Jo Jo 

a > 1 can be made large enough for aA~ x c + A~ x g( z) > 0 because A~ x c > 0 and g is bounded, and since 
jy^Adz < 0, then 


f e 4 (* X ^A [aA x c-\-A x g{z)\dz <0 
Jo 


Therefore, y<z. 
If A~ x c < 0, then 


y = z— f e 4 ^ x ^acdz-\- f e 4 ^ x ^g(z)dz =z— f e 4 ^ X ^A [aA X c — A x g{z)\dz (64) 

Jo Jo Jo 

a can be made large enough for aA~ x c — A~ x g(z) < 0 because A~ x c < 0 and g is bounded, therefore y <z. Thus, 
y <zforalHe [0,oo) and some a > 1. 


Theorem 3: The steady state tracking error is bounded by 

limsup||e||= . [v ||<p|| + v ||^ m || H^" 1 1| ||g £ || + ^ (V A~ T PB) ' ||/3||1 (65) 

t ~ KO t o m in{A^P+vPA m )\. y \ ) J 

if there exists a constant y > 0 such that y = inf, ^ jr J t ‘ +T ° 0 T r0c?r j >0 g 1 and a constant vector ft >0Gl n where 
P = sup, |0 T 6|. 

Proof: Since 0 is bounded by the adaptive law (11) and lim^^ |^(0 T O) jdt\ < o> exists which implies d (0 T O) jdt 
is bounded, then p = sup, |0 T 6| G Jifoo is bounded. 

Since e G J*? 2 , x G J*? 2 , and so O (jc) G J*? 2 , then using Lemma 1 , the adaptive law (11) can be written as 

J t (0 T ®) = (=) T ® + 0 T 6 < —yB T Pe + yvB T A~ T PB ^0 T O - <p - | (yvB T A~ T PB^ ' P ^ (66) 

for t G [UJi + 7o]> where tj = t%- \ + T 0 and i = 1,2, . . . ,n — ► oo. 

Using Lemma 2 with a = 1 for simplicity, we write 

e<A m {e-\A-'B5 s \)+B® T Q> (67) 

Differentiating Eq. (67) and upon substitution yields 

e - (A m + yvBB T A~ T pj e + (yBB J P+ yvBB T A~ T PA m S j e < -yvBB T A~ T PB <p + I (yvB 1 A~ T PB^ p 


+ yvBB T A~ T PA m \A~ l B8 s I (68) 


whose steady state solution is 


(A T m P+ vPA m ) Hm e< — vPB q,+ (yvB T A~ T PB^ ' p +vPA m \A~ 1 BS l 


which leads to Eq. (65). 

The steady state upper bound on the norm of 0 T O is also obtained as 


y ^max (^) IMm || 

lim SUp 0 O = 7 " ~ 

t a min {Al l P-\-vPA m ) 


v ll < p|l + IMmll IMm'll 11^11 + “ (f A~^ Pb)~' ||0 ||] (70) 


American Institute of Aeronautics and Astronautics 



Thus for y — ► oo, the last term on the RHS of Eq. (65) goes to zero, and \\e\\ is only dependent on v. If, in addition, 
v — ► 0, then \\e\\ — ► 0, but if v — ► oo, e e JSfoo is finite and does not tend to zero. Thus, v has to be selected small 
enough to provide a desired tracking performance, but large enough to provide sufficient robustness against time delay 
or unmodeled dynamics. With v = 1 as the optimal value, further increase the value of v beyond the optimal value will 
actually reduce robustness as well as tracking performance. Thus, a practical bound for v is 0 < v < 1 . Both Theorems 
2 and 3 provide a guidance in a trade-off design process for selecting a suitable value of v to meet performance and 
robustness requirements. 


III. Application to Neural Net Adaptive Flight Control 



Fig. 3 - Direct Neural Network Adaptive Flight Control 


Consider the following inner loop adaptive flight control architecture as shown in Fig. 3. The control architecture 
comprises: 1 ) a reference model that translates rate commands into desired acceleration commands, 2 ) a proportional- 
integral (PI) feedback control for rate stabilization and tracking, 3) a dynamic inversion controller that computes 
actuator commands using desired acceleration commands, and 4) a neural net direct MRAC with the optimal control 
modification. 

Damage adaptive flight control can be used to provide consistent handling qualities and restore stability of aircraft 
under off-nominal flight conditions such as those due to failures or damage. The linearized equations of motion are 
expressed as 

x = A\\x-\-A\2Z-\-B\u-\- f\ (x,z) (71) 

z = A 2 \x+ A 2 2 z + B 2 u + f 2 (x,z) (72) 


where Ay and B u i = 1 , 2 , j = 1,2 are known, x = 


-iT 


p q r 


is a vector of roll, pitch, and yaw rates; z = 


A 0 Aa Aj3 AV Ah A 6 


angle A fi , airspeed AF, altitude Ah , and pitch angle A 6; u 


is a vector of perturbation in the bank angle A 0, angle of attack Aa, sideslip 

-iT 


A5 a A 8 e A5 r 


is a vector of additional aileron, 


elevator, and rudder deflections; and ft (x,z), i= 1,2 is an uncertainty due to damage which can be approximated as 


fi{x,z) = ®]Q>{x,z)+£ 


(73) 


0 (jt,z) is a basis function for a sigma-pi neural network with Q, i = 1, ,4, as inputs consisting of control 

commands, sensor feedback, and bias terms; defined as follows: 


Cx = \p (h)V 2 


x T ax r fix 1 


Ci = ^p (h)v 2 


1 <p 6 a ft a 2 ft 2 aft 


C 3 = -p {h)V 2 u(x,z) T au(x,z) T Pu(x,z ) 1 


C 4 = -p (h)V 2 


px qx rx 


(74) 

(75) 

(76) 

(77) 
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where </>=</> + A 0, a = a + Aa, P = /3 + A/3, F = F + AF, h = h + A/z, and 0=0 + A0; and the overbar symbol 
denotes a trim state. 

These inputs are designed to model the unknown nonlinearity that exists in the damaged aircraft plant dynamics. 
For example, the aerodynamic force in the x-axis for an aircraft is given by 


pb 


qc 


rb 


F x - T +-qS\C LQ -\-C La a-\-C Lp P +C Lp — +C Lq — +C Lr — +C Lsa 5 a -hC Lde 8 e -i-C Lsr 5 r I a 


- \qS (c Do +C Da a+C Dp ^+C Dq ^+C Dr ^+C Dsa 5 a + C Dse 5 e + C Dsr 5 r j (78) 


? 2F 


r 2V 


Thus, Ci , C2, and C3 are designed to model the product terms of x, z, and u in the aerodynamic forces and moments 
equations; and C4 models the gyroscopic cross-coupling terms of x in the moment equations. 

The inner loop rate feedback control is designed to improve aircraft rate response characteristics such as the short 
period mode and the dutch roll mode. A second-order reference model is specified to provide desired handling qualities 
with good damping and natural frequency characteristics as follows: 


+2 t^pCQpS + COp) <j) m — gpblat 

+ 2£q(DqS + (Oq ) 6 m = gq8i on 

(. s 2 + 2 C,r(O r S + c D r 2 ) r m = grSrud 


(79) 

(80) 
(81) 


where <j) m9 0 m , and y/ m are reference bank, pitch, and heading angles; (o p , (o q , and (o r are the natural frequencies for 
desired handling qualities in the roll, pitch, and yaw axes; Q p , Q q9 and C, r are the desired damping ratios; Si at9 8i on , and 
8rud are the lateral stick input, longitudinal stick input, and rudder pedal input; and g p9 g q9 and g r are input gains. 

Let p m = <i>m,qm = dm , and r m = i j/ m be the reference roll, pitch, and yaw rates. Then the reference model can be 
represented as 


r*. 

Jo 


K j}%m F i I x m dr “I - Gy 


(82) 


where x m 


, K p = diag (2£ p a> p , 2 C,to r ), K t = diag {a 2 p , co 2 , a 2 ) , G = diag (g p ,g q ,g r ), and 


Pm q m r m 

1 T 

$lat &lon &rud 

Assuming the pair (A\\,B\) is controllable and z is stabilizable, an angular rate feedback dynamic inversion con- 
troller is computed as 


u c = B 


-l 


K- 

K p - 1 — - ] x-\-Gr — A\\x — A\iz — ©j 1 O 


(83) 


where u c is a control surface deflection command vector as an input to a flight control actuator system which can be 
modeled as a first-order system 

u = —A (w — u c ) (84) 

where A = diag (A a , k e , k r ) > 0 is a vector of actuator rates for aileron, elevator, and rudder. 

When the actuator dynamics are fast relative to the reference model, i.e., X m i n (A) >> <j m in (^ w ), then the actuator 
output follows the input closely so that u ~ u c . 

Let e= ^ /o (x m — x) dr x m — x be the tracking error, then the tracking error equation is given by 


e — A m e + B 




where 


Am = 


B- 


-Kt -K p 

0 

I 


(85) 

(86) 

(87) 
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Let Q = 27, then the solution of Eq. (12) yields 


P = 


Kr'Kp+K-'iKi+I) K7 1 

K7 l K~ l (I+K7') 


>0 


A m 1 is computed to be 


v = 


-K7% -K7' 

I 0 


Evaluating the term B T PA m [ B yields 

B T PA m l B = -Kr 2 < 0 

Applying the adaptive optimal control modification (11), the weight update law is then given by 


0! = -ro 



( 88 ) 


(89) 

(90) 

(91) 


IV. Simulation Results 

To evaluate the adaptive optimal control modification, a simulation was conducted using a generic transport model 
(GTM) which represents a notational twin-engine transport aircraft as shown in Fig. 4. 24 An aerodynamic model of 
the damaged aircraft is created using a vortex lattice method to estimate aerodynamic coefficients, and stability and 
control derivatives. For the simulation, a damage configuration is modeled corresponding to a 28% loss of the left 
wing. The damage causes an estimated C.G. shift mostly along the pitch axis with Ay = 0.0388c and an estimated 
mass loss of 1 .2%. The principal moment of inertia about the roll axis is reduced by 12%, while changes in the inertia 
values in the other two axes are not as significant. Since the damaged aircraft is asymmetric, the inertia tensor has 
all six non-zero elements. This means that all the three roll, pitch, and yaw axes are coupled together throughout the 
flight regime. 



Fig. 4 - Left Wing Damaged Generic Transport Model 

A level flight condition of Mach 0.6 at 15,000 ft_is selected. Upon damage, the aircraft is re-trimmed with T = 
13,951 lb, a = 5.86°, 0 = —3.16°, 8 a = 27.32°, 5 e = —0.53°, 8 r = —1.26°. The remaining right aileron is the 
only roll control effector available. In practice, some aircraft can control a roll motion with spoilers, which are not 
modeled in this study. The reference model is specified by co p = 2.0 rad/sec, (o q = 1.5 rad/sec, co r = 1.0 rad/sec, and 
Cp = C 9 = Cr=l/v^. 
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The state space model of the damaged aircraft is given by 


P 

q 

r 


-1.3623 -0.2649 0.5470 

-0.0659 -0.8949 0.0151 

0.0835 -0.00417 -0.5130 


P 

q 

r 


+ 


0 

-0.0007 

0 


-10.9768 

-2.7042 

0.1845 


-9.2449 -0.6380 0.0004 0 

-0.0120 -0.1409 -0.0004 0 
2.8755 -0.0149 0 0 


A0 

Aa 

A/3 

AV/V 

Ah/h 

Ad 



3.2225 -0.0449 1.4887 


" AS* ' 

+ 

0.3393 -3.4650 0.0258 


A8 e 


-0.0123 0.0007 -2.2949 


1 

< 

1 


(92) 


A <j) 


1 

Ad 


-0.0059 

A/3 


0.0978 

AV/V 


0.0001 

Ah/h 


0 

A0 


0 


0 

0.0028 

0.0507 

0 

0 

0 


0 

0.1024 

0.9722 

0.0004 

-0.0001 

-0.9849 

-0.0014 

-0.0006 

0 

0 

1 

0.0551 

0 

0 

-0.4802 

0.0238 

0.0016 

-0.1751 

-0.0453 

0.0044 

-0.0423 

0 

0 

0 


P 

q 

r 


0 

-0.0648 

-0.1850 

-0.0082 

0 

0 


+ 


0 

0 


A (j) 

0.0304 

0 


Aa 

0.0028 

-0.0017 


A/3 

0 

-0.0507 


AV/V 

0 

0.0423 


Ah/h 

0 

0 


A0 

0 

0 

0 

0.0241 

-0.0704 

-0.0012 

0.0053 

0 

0.0604 

0.0018 

-0.0048 

-0.0028 

0 

0 

0 

0 

0 

0 


AS* 

A8 e 

A8r 


(93) 


The actuator dynamics are modeled with k a = k e = A r = 50 /sec with position limits of ±35° for the aileron and 
elevator and ±10° for the rudder. 

The pilot pitch rate command is simulated with a series of ramp input longitudinal stick command doublets, 
corresponding to the reference pitch angle ±3.81 0 from trim. The tracking performance of the baseline flight control 
with no adaptation is compared against the neural net direct MRAC, the E\ modification with fi = 0. 1 , and the present 
adaptive optimal control modification with v = 0. 1 . The adaptive gains are selected as large as possible within the 
numerical stability limit of the adaptive laws. This results in T = 60 for direct MRAC, T = 800 for the E\ modification, 
and T = 2580 for the adaptive optimal control modification. Thus, it can be seen that the optimal control modification 
can tolerate a much larger adaptive gain than the standard MRAC. This large adaptive gain allows the adaptive optimal 
control modification to better adapt to uncertainties than the standard direct MRAC. 
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Fig. 5 - Pitch Rate 


The aircraft angular rate responses are shown in Figs. 5 to 8. Figure 5 illustrates the pitch rate response due to the 
four controllers. With no adaptation, the baseline flight control system cannot follow the reference pitch rate very well 
as there is a significant overshoot. Both the direct MRAC and the £\ modification improves the tracking by about the 
same amount. However, the adaptive optimal control modification appears to provide better tracking than the MRAC 
and the £\ modification. 

Since the damage occurs to one of the wings, the roll axis is most affected. With no adaptation, there is a significant 
roll rate as high as 20°/sec as shown in Fig. 6. Both the direct MRAC and £\ modification reduce to maximum 
amplitude of the roll rate to about 10°/sec. The adaptive optimal control modification further reduces the roll rate to a 
maximum value of about 4°/sec. 




Fig. 6 - Roll Rate 

Figure 7 is the yaw rate response of the damaged aircraft. All the three adaptive controllers significantly reduce the 
roll rate to a reasonably low level. The £\ modification performs slightly better than the MRAC and adaptive optimal 
control modification. 
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Fig. 7 - Yaw Rate 


Figure 8 is the plot of the tracking error j£? 2 norm for all the three aircraft angular rates that compares the overall 
performance of the four controllers collectively in the three axes. When there is no adaptation, the tracking error norm 
appears to grow considerably in the first 10 sec. The £\ modification actually results in higher tracking error than 
the direct MRAC. This could be explained by the fact that the £\ modification trades performance for robustness, so 
the tracking performance is expected to be worse. The adaptive optimal control modification results in the smallest 
tracking error norm as compared to the MRAC and £\ modification. Thus, overall, the adaptive optimal control 
modification performs significantly better than the MRAC and £\ modification. 




Fig. 8 - Tracking Error Norm 

The attitude responses of the damaged aircraft are shown in Figs. 9 to 12. When there is no adaptation, the pitch 
attitude could not be followed accurately as seen in Fig. 9. With adaptation on, the tracking is much improved and the 
adaptive optimal control modification follows the pitch command better than the direct MRAC and the £\ modification. 
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t, sec 


Fig. 9 - Pitch Angle 

Figure 10 is the plot of the bank angle. Without adaptation, the damaged aircraft exhibits a rather severe roll 
behavior with the bank angle ranging from —30° to 20°. Both the direct MRAC and £\ modification improve the 
situation significantly. The adaptive optimal control shows a drastic improvement in the arrest of the roll motion with 
the bank angle maintained closed to the trim value. 




Fig. 10 - Bank Angle 

Figure 1 1 is a plot of the angle of attack. All the adaptive controllers produce similar angle of attack responses. 
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Fig. 1 1 - Angle of Attack 


Figure 12 shows a plot of the sideslip angle. In general, flying with sideslip angle is not a common practice 
since a large sideslip angle can cause an increase in drag and more importantly a decrease in the yaw damping. With 
the adaptive optimal control modification, the sideslip angle is reduced to near zero, while the direct MRAC and £\ 
modification still show some sideslip angle responses. 





Fig. 12 - Sideslip Angle 

The control surface deflections are plotted in Figs. 1 0 to 1 2. Because of the wing damage, the damaged aircraft has 
to be trimmed with a rather large aileron deflection. This causes the roll control authority to severely decrease. Any 
pitch maneuver can potentially run into a control saturation in the roll axis due to the pitch-roll coupling that exists in 
a wing damage scenario. With the maximum aileron deflection at 35°, it can be seen clearly from Fig. 13 that a roll 
control saturation is present in all cases. The range of aileron deflection when there is no adaptation is quite large. As 
the aileron deflection hits the maximum position limit, it tends to over-compensate in the down swing because of the 
large pitch rate error produced by the control saturation. 
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Fig. 13 - Aileron Deflection 

Figure 14 is a plot of the elevator deflection which is shown to be within a range of few degrees for all the four 
controllers and well within the control authority of the elevator. This implies that the roll control contributes mostly to 
the overall response of the wing-damaged aircraft. 






Fig. 14 - Elevator Deflection 

The rudder deflection is shown in Fig. 15. With no adaptation, the rudder deflection is quite active, going from 
almost —7° to 2°. While this appears small, it should be compared relative to the rudder position limit, which is 
usually reduced as the airspeed and altitude increase. The absolute rudder position limit is ±10° but in practice the 
actual rudder position limit may be less. Therefore, it is usually desired to keep the rudder deflection as small as 
possible. Both the direct MRAC and £\ modification improve the situation somewhat, but the adaptive optimal control 
is able to keep the rudder deflection quite small and virtually almost at trim. 
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Fig. 15 - Rudder Deflection 

To demonstrate stability robustness of the adaptive optimal control modification, the time delay margin (TDM) 
is computed numerically in the simulations as a function of the parameter v. A time delay is introduced between 
the actuators and the damaged aircraft plant model and is adjusted until the adaptive optimal control modification 
algorithm is on the verge of instability. The results are plotted in Fig. 16 for an adaptive gain T = 60. As v increases, 
the time delay margin also increases. This results in a more robust controller that can tolerate a larger time delay which 
acts as a destabilizing disturbance to the controller. However, for the same adaptive gain, increasing v tends to degrade 
the tracking performance. Therefore, in general, v is selected to balance the competing requirements for performance 
and stability robustness that usually exist in a control design. 



Fig. 16 - Estimated Time Delay Margin 

V. Conclusions 

This study presents a new modification to the standard model-reference adaptive control based on an optimal 
control formulation of minimizing the J *?2 norm of the tracking error. The adaptive optimal control modification adds 
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a damping term to the adaptive law that is proportional to the persistent excitation. The modification enables fast 
adaptation without sacrificing robustness. The modification can be tuned using a parameter v to provide a trade-off 
between tracking performance and stability robustness. Increasing v results in better stability margins but reduced 
tracking performance. When v approaches unity, the system is robustly stable with all closed-loop poles having 
negative real values. Simulations of a damaged generic transport aircraft were conducted. The results demonstrate the 
effectiveness of the adaptive optimal modification, which shows that tracking performance can be achieved at a much 
larger adaptive gain than the standard direct model-reference adaptive control. As a result, significant improvements 
in performance can be attained with the adaptive optimal control modification. 
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